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A GENERALIZATION OF THE TRIE DATA STRUCTURE 



Richard H. Connelly and F. Lockwood Morris 

Abstract. Tries, a form of string- indexed look-up structure, are generalized to per- 
mit indexing by terms built according to an arbitrary signature. The construction 
is parametric with respect to the type of data to be stored as values; this is essen- 
tial, because the recursion which defines tries appeals from one value type to others. 
"Trie" (for any fixed signature) is then a functor, and the corresponding look-up 
function is a natural isomorphism. 

The trie functor is in principle definable by the "initial fixed point" semantics 
of Smyth and Plot kin. We simplify the construction, however, by introducing the 
"category-cpo" , a class of category within which calculations can retain some domain- 
theoretic flavor. Our construction of tries extends easily to many-sorted signatures. 



Section 1. Introduction. 

A look-up table — a finite data structure intended for the retrieval of values which 
have been stored corresponding to "keys" — is naturally regarded as a concrete im- 
plementation of what abstractly is a function from keys to values, but just what 
sort of function deserves some consideration. If the value type is A, which for 
convenience we shall think of as always containing a distinguished element or base 
point *a — we call such a type a pointed set — and the key type is Y , we may say 
that a table models a function from Y to A whose value is *a (representing the 
absence of a genuine value) for all but finitely many arguments; we introduce the 
notation for the set of all such functions. 

Tries [4,6] are a form of look-up table suited to the situation where keys are 
strings over a finite alphabet. Our innovation here will be to extend the possible 
sets of keys from "strings over any finite alphabet" to "terms built with any finite 
signature of operators" . We begin by giving a description of ordinary string-indexed 
tries (omitting optimizations found in more practically-oriented treatments) in such 
a way as to make the generalization to indexing by terms as obvious as possible. 

Let H be the set of finite strings over a finite alphabet {ci, . . . ,c m _i} for any 
m > 1. (To be sure, if m is one, H contains only the empty string, but there is 
no reason to forbid that case.) An iJ-indexed, A-valued trie is a finite (m — l)-ary 
tree, its nodes labelled by elements of A. (We shall be wanting to consider tree 
nodes as m-tuples, with the label as mth component; this accounts for our taking 
the size of the alphabet to be m - 1.) A trie is searched by the evident recursive 
principle: the empty (m — l)-ary tree, which we denote by • (pronounced "spot"), 
represents that function in A^ whose value is *a for every string, while an m-tuple 
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(ri, . . . , r m _i, a) represents the function whose value at the empty string is a and 
whose value at a non-empty string Cih is the value at h of the function represented 
by Ti. 

To put this more formally, we may regard H as a term algebra: the set of all 
terms which can be built with ra — 1 unary operators ("prefix a") and one miliary 
operator ("empty string"). [This is more often called a "word algebra"; we go 
against convention because we shall be wanting "word" in a distinct technical sense 
in Section 4.] To make a specific construction, we may, following Reynolds [9], 
define H as the least solution of the set equation 

ra 

H = J2 ( if * < m then h else {( )}) 

where by the summation notation we intend the following specific disjoint union of 
sets: 

J2%i = {(h z) \ 1 <i <m and z G Zi}. 
i=i 

Here {()} is a convenient one-element set— we take the zero-tuple for its element 
in the same spirit that we say the empty string is a miliary operator. Then, as is 
well known, H is explicitly given by H — (J n i7 n , where H 0 = 0, and for n > 0, 

m 

H n+ i = J2 Of * < m *hen if n else {()}). 

z=l 

(A remark on notation: We will be doing a lot of indexing from zero to infinity; 
we therefore will write things like [j n H n as shorthand for [j n>0 H n .) Given this 
construction of if, we can write an explicit recursive program for the look-up or 
"apply" function: 

ap • h = *^ 

ap (n, . . . , r m _i, a) (ra, ( )) = a 

ap (ri, . . . , r m _i, a)(i, /i) = apr^ h for 1 < i < ra. 

Before leaving the case of strings as keys, we give a more rigorous, and also 
slightly more restrictive, definition of the set of if-indexed, A-valued tries. Note 
that our first description, that a trie was any ^-labelled (ra — l)-ary tree, allowed 
the everywhere-*^ function to be represented not only by •, the empty tree, but 
also by any tree all of whose node labels were *a- We now decide to require that 
• be the only allowed representation for this function, so as to get a one-to-one 
correspondence between the functions in and their representing tries. To this 
end we introduce a modified Cartesian product Yl 9 (called "spot product" , if this 
is not too cutesy) defined for any ra pointed sets by 

m A f m 

n.^ = n^-{K *a.)}u{.} 

1=1 i = l 

with • taken to be the base point of the resulting set. 
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Now we can say that our set of tries is the least solution to the set equation 



m 



R = l\ 9 (if i < m then R else A) 



and is given explicitly by 



Ro = {•} 



R n +l = EL(tf * < m *^ en e ^ Se A) 

i=l 



R — (J Rn • 



n 



The reader will have noticed the parallelism between the constructions of R and 
H, and may foresee that the one-to-one correspondence between R and will 
prove to be a consequence of a "law of exponents" which we give as: 

Proposition 1.1. If X\, ... , X m are sets, m > 0, and A is a pointed set, then 
there is a one-to-one correspondence 



m 



given by 

■ • • -,9m) •-»■ A(t,z) : /»-► (Aa?i./(l,xi),...,Ax m ./(m,x m )). 



Proof. We know that the formula for /x m gives a one-to-one correspondence for 
unrestricted functions, /x m : A Xl x • • • x A Xm = A Xl +*" +Xm ; j s ^ e CO prod- 
uct property of the m-ary disjoint union in Set. We have only to notice that 
this correspondence cuts down to the almost- everywhere-*^ functions: we have 
(<7i, . . . , g m ) G A^ Xl ^ x • • • x A^™] if and only if, for z = 1, . . . , m, each x» 7^ *a 
just for some finite set of values of ar», say for x» € . . . ,Xirn}] this is the same as 
to say that /x m (<7i, . . . , 9m)(h z ) = (Mh z )-9i z ){h z ) is different from *^ just for (z, z) 
one of the finite set of values {(1, #11), . . . , (1, x\ ni ) y . . . , (m, ar m i), . . . , (m, x mnm )}, 
i.e., that /x m (pi, . . . , gm) € □ 

For any set Y and pointed set A we take *a\y\ to be the constant function Ay.*^. 
We may then observe further that the correspondence /x m is base point preserving. 
Also, there is an evident base-point-preserving, one-to-one correspondence between 
the modified and the ordinary Cartesian products of A± Xl \ . . . , A^ Xm \ Composing 
these correspondences, we may record for later reference: 

rn 

Definition 1.2. Denote by : JJ # A^ Xi ^ = ^EI^i^J the base-point-preserving, 

1=1 

one-to-one correspondence 



• 1— > A(i, z).*a 
(9i,-->,9m) »-> \{i,z).giZ. 
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We have just seen string-indexed tries presented with strings taken to be the 
elements of a particular term algebra. We now generalize to keys which are the 
elements of an arbitrary term algebra, first one-sorted and then, in Section 4, many- 
sorted. A term algebra is characterized by its operators, say m of them; the only 
significant property of each operator is the number of operands it expects (its arity), 
a non- negative integer ki for i — 1, . . . , m. Thus, following Reynolds [9], we may 
take our generic one-sorted term algebra to be the least set T solving the equation 



m, 
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namely 

T = |Jr n , where T 0 = $ and T n+ i = £T*'. 



n 



The key to generalizing the trie idea, still obtaining a set in one-to-one corre- 
spondence with A\ T \ in the presence of operations of unrestricted arity is another 
law of exponents. 

Proposition 1.3. If Xi, . . . , Xk are sets and A is a pointed set, then there is a 
one-to-one correspondence 



given by 



Vk : 9 »-> . . .,Xk).gxi • < -x k , v h x : f i-> Xxi • • • Xx k .f{x u . . .,£*.). 

ryi 

Proof. We treat first the case fe = 2, that is z/ 2 : 

A [Y)w & A^ XxY l For functions 
without restriction, this is the "uncurrying" isomorphism, part of the Cartesian 
closed structure of Set. Again, we have only to check that it cuts down to the 
almost- everywhere-* functions. Suppose / = V29, and suppose that / G A^ XxY \ 
i.e., that for some N we have f(x, y) / *a only for (x, y) G {(xi, yx), . . . , (x^v? 2/at)}. 
Then gx / on ly f° r ^ m the finite set (possibly with fewer than N elements) 
{#1, . . . , xjsr}, and even when x is one of these, gxy ^ *a only for (x, ?/) in the set of 

iV pairs, so that gx G in all cases, and g G A^ . Conversely, if g G A^ , 

then gx / on ly f° r x i n some finite set {x 1? . . and for i = 1, . . . , M, 

2/ *A only for, say, y G . . . , y in J, which is to say that z/ 2 gr(x, j/) / *a 

only for the m H \-n M pairs (xi,yn), . . . , (x M ,y M n M )) s0 that ^2^6 A^* 1 ^. 

Now we may treat general k > 0 by induction. For k = 0, it is immediate that 
: A = A^^, where z/ 0 a = A{ ).a, and f^ 1 / = /(). Then supposing inductively 

that any ^ has been shown to be a one-to-one correspondence, we may observe 

that z>%+i is the composition 

A [X k+1 ]-l x *l l 1J Vt^ ^ 2 x-xX H1 ]^l A [Xxx(X 2 x-xX k+1 )] 

Xf.fo\(x 1 ,...,x k+1 ).{xi,(x 2 ,...,x k+1 ))^ .[X 1 x-xX k+1 ] 
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of which each step is a one-to-one correspondence (the instance of v 2 in the middle 

rv 1 

step is with X = X 1: Y = X 2 x • • • x since following any g e A^ Xk ^ "' 

from left to right we find 

g i-> \x 1 \(x 2 , . . .,x k +i).gxi ■ • -z/e+i ^ A(a;i, (x 2 , . . . , Xk+i)).gxx • • -Xfc+i 



A generalized trie has to represent a function whose values are functions. That 
is, an ordinary trie and its sub-tries all represent elements of A^ H \ whereas a gen- 

eralized trie and its parts have to represent elements of A^ T \ A^ = A^ TxT \ 

etc. But we can think of A^ , for example, as two iterations of "— t T ^' at the 
set A; so if we can abstract on the type A, and regard the generalized trie idea as 
a scheme for representing functions from T to any type, then we should be able to 

iterate this scheme twice at A to get a representation of A^ . This motivates 
the following definition of the set of T-indexed, A- valued (generalized) tries, where 
now A is an explicit parameter ranging over pointed sets: 

R 0 (A) = {.} 

R n+1 (A) = f\ m R^\A), 

i=l 

here Rn (A) denotes R n (- • • (R n (A)) . . . ) with ki iterations of R n , and 



R(A) = \jR n (A), 

n 

with = • independent of A. When ki — 1 for i < m and km = 0, R(A) 

reproduces the string- indexed tries as previously defined (that is, R(A) = i£). 

As an example term algebra let us take binary trees. We simulate a one-bit 
label on each node by providing two binary constructors; that is, we take m = 3, 
fa — 0 (to construct the empty binary tree), k 2 = k$ = 2; we may express this 
more comprehensibly as the recursive set definition 

T B = {()} + T B 2 + T B 2 . 

Then the Te-indexed, A-valued tries have the corresponding recursive definition 

R B (A) = Ax R b (R b (A)) x R B (R B (A)) . 

def 

To actually make an example of a trie, let Z* = Z U {*}, that is the inte- 
gers extended with a base point; then the function from Tb to Z* that maps 
the one-node binary tree (2, ((1, ()), (1, ()))) to 7 and the two-node binary tree 
(2, ((1, ()), (3, ((1, ()), (1, ()))))) to 8, everything else to *, is represented by the trie 



(*, ((7,., ((8, •,•),•,•)), •,•),•). 
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Returning to the general treatment, it seems intuitively reasonable to suppose 
that the following equations define a family of look-up functions, also parameterized 
by the pointed set of possible values, and similar to what we had in the string- 
indexed case but more recursion-intensive: 

apA • t = *A 

ap^ <ri,...,r TO )(i, (ti,... ,tfc.» 

= ap A (ap#( A )(- • • (ap fl ( fci -i) (A) r» h) * • • ) t ki -i) t ki . 

Note that values of i for which ki = 0 correspond to nullary operators, that is 
constants, of the term algebra; for such i we have 

apA (ri,...,r m )<i,()> = r<. 

It is by encountering a nullary subterm that ap is able to take a step towards 
escaping from its apparently ever-more-deeply-nesting recursion. 

It may be easier to make sense of ap specialized to our binary-tree-indexed 
example, which comes out as 

B-ap A • b — *a 
B-ap A (ri,r 2 ,r 3 )(l,()) = r x 

B-ap A <r 1? r 2 ,r 3 )<2, (6i,6 2 )) = B-ap A (B-ap^ B(A) r 2 b 1 )b 2 
B-ap A (ri, r 2 , r 3 )(3, & 2 » = B-ap A (B-ap HB(A) r 3 61) 6 2 . 

The reader may care to verify that B-ap z ^ actually will return 7 and 8 from the 
example trie for the appropriate two binary trees as keys, and * for other keys. 

We will prove, by the end of Section 3, that this ap actually is well defined. 
(In reality, of course, one wants to implement ap as a single subroutine, not an 
infinite family. Apparently, all ap really needs to know about the type at which 
it is supposedly working is the relevant base point which it might have to return. 
Thus a practical program might be 

ap'(6p, •) t = bp 

ap'(6p, (n, . . . , r m »(i, (<i, ...,t hi )) = ap'(6p, ap ; (», • • • (ap ; (», n) t x ) • • • ) t ki -i) t k . . 

In situations where it can be arranged that • and * A are identical, even the pa- 
rameter bp would be unnecessary. We shall, however, not pursue this line further, 
preferring to keep the value type as a parameter.) 

Stated in terms of sets, what we hope to establish, in order to show that the 
generalized tries really do represent finite functions, is a one-to-one correspondence 
R(A) = for every pointed set A. In outline, the proof goes as follows: For 
any set Z, write T% for the set-to-set mapping A 1— > AS Z \ then Proposition 1.1 and 
Proposition 1.3 will yield 

Y[.Tp\A) - zki) (A) for all A. 

7-1 
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Then we may hope to prove by induction that for all n > 0, 

R n {A) ^ T Tn {A) = A [T " ] . 
For n = 0, {•} = A^; as an inductive step, calculate 

i=l i ~ 1 1 

One would then like to conclude that, in the limit, 

R(A) = \jR n (A) 2i A^Tn] = A [T]_ 

n 

Making this calculation rigorous, and showing that the family of one-to-one corre- 
spondences R(A) = it yields is in a suitable sense the least fixed point of the 
recursion equations for ap, will be the purpose of the following two sections, with 
assistance from the appendix, where we have segregated such necessary definitions 
and theorems as belong entirely to category theory. 

Not surprisingly, we are able to view the T introduced above as a (Curried) two- 
argument functor, having its second argument and its result in the category Set* 
of pointed sets and base-point-preserving functions, but taking its first argument 
(the Z in A^) from the partial order of sets and set inclusions. By this choice 
for its domain, we are able to define T so as to be covariant in both arguments, 
and are spared the difficulties which led Smythe and Plotkin in [11], needing a 
covariant arrow bifunctor within a category of domains, to introduce a subcategory 
with "embeddings" as morphisms. Nevertheless, T bears some resemblance to an 
exponential functor, and the isomorphism of Proposition 1.3 is much the same as 
that which gives Set* with smash product as a tensor product — or equivalently the 
category of sets and partial functions with Cartesian product as tensor product — its 
monoidal closed structure (see, for example, Poigne [8]). 

The function R which constructs for any A the T- indexed, A- valued tries is in 
fact an endofunctor of Set*. Moreover, ap is a natural isomorphism from R to the 
functor — t T J, as we shall show. The formula given above for the construction of 
each R n +i from R n amounts to the definition of an endofunctor TZ of Set^ et * , of 
which R is an "initial fixed point" as defined by Smyth and Plotkin [11]. 

It is because the definition of ap^ appeals to &Pr(a)i etc. — in other words, be- 
cause ap must be polymorphic to work at all — that we are compelled to look for a 
whole functor, rather than a single data type, to be an initial fixed point. 

It was originally our intention to carry out the rigorous construction of R and the 
natural isomorphism ap as a straightforward application of the Smyth and Plotkin 
method, which generalizes the familiar "least fixed point of a continuous function" 
construction from domain theory to a categorical setting. However, it has turned 
out that, despite the presence of categories and functors, the trie construction re- 
tains a very domain-theoretical flavor, because the sets R(A) are unions of inclusion 
towers, as is also the set T of terms. It therefore has seemed to us worthwhile to 
make a preparatory digression, introducing in Section 2 a kind of domain-category 
hybrid, the "category-cpo" . This will allow our desired natural isomorphism of 
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functors to be constructed (in Section 3) as the least upper bound of an ascending 
chain rather than, as would be done by a more general category- theoretic treatment, 
as the colimit of a general u;-sequence of objects and morphisms. 

There seems to be a growing recognition that category theory is relevant not 
only to semantics, but to the more mundane algorithms-and-data-structures side of 
computer science. Spivey [12], for example, uncovers natural transformations and 
adjunctions in familiar list-processing functions. It may not, however, be generally 
appreciated that the construction of even a first-order data type can, as here, call 
for categorical methods. The present paper uses rather a lot of mathematics to 
arrive at a modest algorithmic result, but we hope that some of the tools developed 
here will be reusable in other applications. 

Section 2. The notion "Category-cpo". 

It is a commonplace observation (see for example [7, p. 11]) that a partial or- 
der may be regarded as a category in which each hom-set contains at most one 
morphism, and which moreover is skeletal: isomorphic objects are identical (this, 
together with the uniqueness of morphisms, entails that the only isomorphisms are 
the identities). For a category which in this way "is" a partial order, C, on its 
objects, we will, when a and b are objects such that a C 6, write (a C b) as a 
notation for the (unique) morphism. 

Definition 2.1. A category -partial order (category-po for short) is a pair (if , 
where K is a category and □ is a subcategory of K which is a partial order on all 
the objects of K, and such that the identities are the only morphisms of C that are 
isomorphisms in K. 

The morphisms of the subcategory □ will be called the "inequalities" of K. Note 
that this usage of "inequalities" includes also "equalities", i.e., identity morphisms. 
The last condition in the definition is equivalent to requiring that the insertion 
functor i : C — > K reflect isomorphisms [7, p. 150], since no non-identity is an 
isomorphism in C. We will never have occasion to consider more than a single 
partial-order subcategory of any one category K\ hence, by abuse of notation, we 
will generally write just K and not {K, C) as our name for the category-po. 

In any category-po (if, C) a partial order is induced on all the morphisms of 
if, as follows: if / : a — > b and g : c — > d, we write / □ g (using the same 
partial order symbol as between objects) just in case a C c, b C d, and we have 
the commutative square 

a C c 

w 4 i 9 

Verification that C between morphisms is a partial order is immediate. This partial 
order is identical with the comma category Since i reflects isomorphisms, so 
does the insertion from i[i to Ik [Ik- Consequently, Ik [Ik is also a category-po. 
It is also immediate, for any objects a and 6, that a C b if and only if l a C 1^. 

Composition in a category-po is monotone where defined: if a — ► b — ► c and 
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a! — ► b' d are such that / C /' and g □ then g o / □ g ( o /', as one sees by 
pasting the squares together. 

A noteworthy elementary fact is the following: 

Fact 2.2. In a category-po, if f,g\a — > b with f C g, then f = g. 

This is because the commuting square by virtue of which it holds that / C g 
must have l a and 1& for sides. (This observation makes it clear that category- 
pos are very unlike typical categories of domains and continuous functions: in a 
category-po, all the non-trivial instances of "approximation" of one morphism by 
another must be between morphisms from different hom-sets. A motivation for the 
development of category-pos may be taken from the familiar observation that if / 
and g are (graphs of) two total functions with the same domain, then / C g implies 

f = g-) 

We now introduce the principal notion with which we intend to work. 

Definition 2.3. A category-po (K, Q) is a category- complete partial order (for 
short, a category-cpo) if C is uj-complete (that is, every ascending u-chain of objects 
has a Lu.b.) and each such Lu.b. is an u-colimit in K. 

The requirement that each a;-l.u.b. be a colimit is identical to requiring that the 
insertion functor i : □ — > K preserve cj-l.u.b.s as colimits. (We should remark 
that a category-cpo is a special case of a "double category" as defined by MacLane 
[7, p. 44], since the instances of C between morphisms are certain commutative 
squares in K\ however, we do not know how to apply this observation.) 

Notational remark: We use the tuple brackets (— ) , with a binding occurrence 
of n indicating as with (J and the like that n runs from zero to infinity, as a notation 
for infinite sequences and especially ascending chains. For example, as a synonym 
for "sequence &o E &i E &2 E • • • of objects" we may write "a;-chain of objects 

The next lemma shows that the insertion functor i : □ — ► K reflects [7, p. 150] 
colimits of a;-chains. 

Lemma 2.4. Given a category-cpo K, let k be an upper bound of an u-chain 
ko E ki C • • * of objects of K. If ((& n C A;)) is a colimit cone, then k = |J n k n . 

Proof. Since (|J n k n C k) mediates from colimit cone ((fc n C LL^)) n ^° cone 

((k n E k)) n , it must be the unique mediating morphism (u.m.m. for short). Since 

((k n Q fc)) is also a colimit cone, (|J n k n □ k) is an isomorphism and so must be 
an identity. □ 

The product category of a family of category-epos is a category-cpo using the 
componentwise ordering and Lu.b. The following proposition shows that the mor- 
phisms in a category-cpo are uncomplete. 

Proposition 2.5. Let K be any category-cpo. The morphisms of K are uj- 
complete using the ordering defined by Diagram 1. Specifically, for any cu-chain 
(U : (hi — > b n ) n , there is a Lu.b. \J n f n \ [_\ 7l a 7l — ► |J n b n . 



Proof. Consider (f n ) as a sequence ({a n ,b n , f n )) in Theorem A. 4 gives that 
P : %{% — ► K x K creates a colimit object {[J n a n , |J n b n , /). The morphism / is an 
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upper bound of (/n) , because the created colimit cone is (((a n □ U m a ™)> i^n E 
Um ^° see ^ * s ^ e l* u -k-> let 5 : c — ► d be any upper bound of - 

Since (c, d, g) is a vertex of ((a n , b n , /n)) ? Fact A.l applied to the functor P gives 
that ((Un a ™ E c),(|J n fe n C d)) is the u.m.m. from (U n a mU n b ^f) to ( c > d »flO- 
That is, / C 0- □ 

Fact 2.2 gives the following useful fact. 

Fact 2.6. If g : Un a ™ — * Un b n is an upper bound of an uj-chain of morphisms 
(f n : a n — ► fc n ) n , then g = U n / n . 

It follows from Fact 2.6 that l|j ttn = Un l<*n f° r a o E a i E a 2 E * * • • It also 
follows that composition of morphisms is ^-continuous: by monotonicity, |J n g n o 
Un fnHgn° fn for each n; therefore, by Fact 2.6, Un 9n ° Un fn = Un(^ ° /n)- 

For any category-epos L and we say that a functor F : L — ► K is continuous 
if and only if F preserves inequalities [ F(a □) = (Fa C Fa) ] and is ^-continuous 
on objects [ F(U n a ™) = Un F( a n) ]■ ("Continuous" seems the only reasonable word 
to use in our context. Note, however, that this is not ^-continuity of functors as 
ordinarily defined, that is preservation of all cj-colimits.) 

Any continuous functor F is ^-continuous on morphisms: First, F is monotone 
on morphisms: an instance of C between morphisms, say 



a C 



a 



f 



f 



b C 6, 



is sent by i* 1 to 



Ff 



Ff 



Fb F l. 



Then, for any a;-chain of morphisms (/ n : a n — ► b n ) , monotonicity gives that 

F(U n fn) 3 F(f n ) for each n. Since F(\J n f n ) : /(U n o n ) = |J n F(a n ) — 

^(LL 6n) = U. *X&»), Fact 2.6 gives F(|J n / B ) = |J n F(f n ). 

For any continuous functors F,F* : L — ► K, let r : F — ■+ be any natural 
transformation. For any a C a in L, we have the commutative diagram 



Fa — — — ► 



Fa 



F'a 



F'(C)=C 



That is, a C a implies r a C r s . For an u;-chain (a n ) i n it follows that ry^ an □ 
r an for each n. Since the domain of t]j a „ is ^(Un a ^) = Un^( a ^) an< ^ the 
codomain is F'(\J n a n ) = Un^'( a ™)> Fact 2 - 6 S ives r U ~ UnW That is > an Y 
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natural transformation between continuous functors is a ^-continuous map from 
objects to morphisms. 

Clearly, functor composition preserves continuity. 

Let L be any (index) category and K be any category-cpo. Define a partial order 
C between functors from L to K by 

F C F O Fl C Fl for every object I and Ff C F/ for every morphism /. 

This is the same as to say that the assignment / i— ► (Fl □ FZ) is a natural transfor- 
mation from F to F. If (F □ F) is an isomorphism, then each component (Fl □ FZ) 
must be an identity, making (F □ F) an identity transformation. Thus the trans- 
formations C as inequalities make into a category-po. The next proposition 
shows that K L is a category-cpo. 

Proposition 2.7. Let L be any category and let K be any category-cpo. The 
category K L is a category-cpo using the C ordering. Specifically, for any cu-chain 
(F n : L — > K) , the l.u.b. is (\_\ n F n )(l) = \J n Fn(l) for each object I and 
{\-\ n Fn)(h) = \_\ n ^ri(h) for each morphism h. Further, if L is a category-cpo 
and each F n is continuous, then \J n F n is continuous. 

Proof. By Fact A. 3, the functor i* : K L — > creates a colimit object F 

with colimit cone {(F n C F)) where F(Z) = Un^™(0 ^ or eac ^ °bject I. For any 
morphism h, since F(/i) □ F n (h) for each n, Fact 2.6 gives that F(fr) = |J n F n (h). 

Let G be any upper bound of (F n ) . Since C : i*(F) — ► is the u.m.m. in 

if l L ', Fact A.l gives that it is also the u.m.m. in K L . That is, F r G and F is the 
l.u.b. U n F n . 

Now let L be a category-cpo as well and suppose that each F n in the cj-chain 
(F n : L — > K) is continuous. It needs to be shown that [_\ n F n is continuous. To 
show that \_\ n F n preserves inequalities, let / C I, and calculate 

(U F n)(/ E 0 - U F »(* E 0 = U( F ^ E FnO" 

n n n 

Since ((|_| n -^n)(0 E (LL -^0(0) * s an u PP er bound of the sequence of inequalities 
(F 0 lQF 0 l) □ (Fi/C-FiO C • • • , Fact 2.6 gives us that also 

((U^»)(0 - (LK)(o) = U( F » Z - 

n n n 

To show ^-continuity on objects, let Iq C li □ ■ - * be an u;-chain in L\ then 

(U (U '«) = U ^ (U = U U ^(U = U U ^(U = U (U F n) Vm). 

n m n m n m m n m n 

□ 

If K and L are both category-epos, we henceforth denote by K L not the cat- 
egory of all functors from L to K but the full subcategory thereof whose objects 
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are the continuous functors. Exponential objects given by this definition of K L , 
together with the products noted above, may be shown to make the category of 
small category-epos and continuous functors Cartesian closed. 

Let K, L, M be any category-epos. The composition functor o : M L x L K — ► 
M K [7, Exercise II. 6. 3, p. 45] is continuous. To see this, again treating preservation 
of inequalities first, calculate, for any k G K (see [7, p. 43 eqn. 2] for the horizontal 
composition of natural transformations): 

[{H tH)o(Gt G)]k = H(Gk □ Gk) o (H(Gk) C H{Gk)) 

= (H(Gk) C H(Gk)) o (H(Gk) C H(Gk)) 
= (H(Gk) C H{Gk)) 

= (H o G t H o G)k . 
Then for ^-continuity on objects (functors): 

([J H n o \J G n ) (k) = L| H n ([J G m (i)) = [J [J H n (G m (k)) = \J(H n o G n (fc)) 

n n n m m n n 

for every object fc, and 

(LK°U G n ) (/) = □ H n (□ G ro (/)) = □ □ H n (G m (f)) = \J(H n o G n (f)) 

n n n m m n n 



for every morphism /. 

Of particular use will be the n-fold composition functor o n : (K K ) — ► K K 
which by induction is continuous. (Zero-fold composition picks out the identity 
functor on K.) If we write A n for the diagonal functor, defined for both objects 
and morphisms by A n (x) — (a;,..., a;), with the result an n-tuple— this notation 
leaves the domain and codomain of A ns in every particular use of it, to be inferred 
from context — then it follows that n-fold iteration, 

_(n) d|f A 

is, for each n > 0, a continuous endo functor of K K . 

It has been noted that Ik [Ik is a category-po when K is, and the reader may 
have surmised that the same holds with "category-cpo" replacing "category-po". 
Here is a proposition that gives a more general condition under which a comma 
category is a category-cpo. First, for any category-epos L and M, any category 
if, and any functors T : L — > K and S : M — ► K, the comma category T[S 
is a category-po, where we take the inequalities ((Z, m, /) C (/", m, /)) to be the 
morphisms of the form ((/ □ /), (m □ m)). 

Proposition 2.8. Let L and M be any category-epos and let T : L — ► K and 
S : M — ► K any functors where T preserves Lu.b.s as colimits. Then the comma 
category T[S is a category-cpo. Specifically, for any u-chain ((/ n , m n , / n )) ? an d 
letting I = [J n l n and m — |Jn m ™ ^ or conciseness, the morphism f in the l.u.b. 
(/,m, /) is the u.m.m. in K from (T(l n C I)) to (S(m n □ m) °/n) • Additionally, 
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if S preserves Lu.b.s as colimits and f n is an isomorphism for each n, then f is an 
isomorphism. 

Proof. Theorem A. 4 gives that the forgetful functor P : TIS — > L x M creates 
a colimit object (Z,m, /) with colimit cone ({(In E /), (m n C m ))) n - Theorem A. 4 
also gives that / is the u.m.m. from (T(l n □ I)} to (S(m n E m) o f n ) and that 
/ inherits the isomorphism property when S preserves Lu.b.s as colimits. 

To see that (/, m, /) is the l.u.b., let (c, d, g) be any upper bound of ((Z nj ra n , /n)) • 
Since ((/ E c), (m E d)) is the u.m.m. in L x M, Fact A.l gives that it is also the 
u.m.m. in TIS. That is, (l,m, /) E (c,d,g). □ 

Here is an analog to Fact 2.2 for comma category-pos. 

Fact 2.9, In any comma category-po TIS, if {I, m, /) E (/, m, then f = g- 

This fact may be seen by realizing that the morphism ((i, m, /) E (i, m, #)) is the 
identity (1;, l m ). From Fact 2.9 follows also an analog for comma category-epos of 
Fact 2.6: If an object of the form (|_| n l n , |J n ra n , #) is an upper bound of the chain 
((in? TO n, /n)) * n ^jS, then it is the least upper bound. 

The next proposition uses Lemma 2.4 to give a condition under which a functor 
whose codomain is a comma category-cpo is continuous. 

Proposition 2.10- Given a comma category-cpo T[S where T : L — > K and 
S : M — > i£T, let Q be any category-cpo and let F : Q — ► T|5 be any functor. 
If P o F is continuous, where P : T[S — > L x M is the forgetful functor, and T 
preserves Lu.b.s as colimits, then F is continuous. 

Proof. F preserves inequalities, because P o F does and P does not modify mor- 
phisms. Let qo E Qi E • • ' be any cj-chain in Q. Since P o F is continu- 
ous, P(F(|J n g n )) is the vertex of a colimit cone ((P(F{q n )) C iWUn 9»)))) n - 
Since, by Theorem A. 4, P creates colimits, Corollary A.4.1 gives that {(F(q n ) E 
F(U n 9^))) n * s a co ^ m ^ cone - Lemma 2.4 then gives that -FflJn ~ Un ^Ofo)- d 

When if in Proposition 2.8 is a category-cpo, we may regenerate the partial 
order on morphisms by taking T — S = i : C — ► if, the insertion functor. We 
chose to introduce E on morphisms beforehand in order to be able to partially order 
functors. 



Section 3. One-sorted Tries. 

We will need four particular category-epos for our application to tries, three 
which we introduce now, and a comma category-cpo to be named later. The first 
is the category-cpo (trivially one, because it is a cpo) whose objects are sets and 
whose morphisms are only the inclusions between sets; we denote it by Seto The 
second is (Set*, C), the category-cpo of pointed sets, with an inequality taken to 
be any set inclusion A C A (provided this actually is a morphism of Set*, that is, 
provided A and A have the same base point). We will invariably call this category- 
cpo simply Set*. The third is Set^ et * , the category-cpo of continuous endofunctors 
of Set* with their natural transformations; it is here that we hope to find the trie 
functor R. For now we identify one object of Setf et * : we denote by _L. the constant 
functor that maps every object of Set* to {•} and every morphism to 
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We define the mapping of sets 

771 

T(Z)= £Z fc S 

1 = 1 

so that our word algebra T is given by T — \J n T( n \$). The monotonicity and 
continuity with respect to C of disjoint union (finitary or not) and finitary Cartesian 
product of sets are elementary facts— but for a proof of the latter, remove the spots 
in the proof of Proposition 3.3 below — hence these constructions are continuous 
endofunctors of Seto It follows, by composition of functors, that T : Setc — ► 
Setc is a continuous functor. 

We have to show that defined in Section 1 as the mapping Z h is a 

functor from Setc to Setf et * which preserves l.u.b. s as colimits, that there is a 
continuous functor 11 : Setf et * — ► Setf et * such that R n — TZ^(±,) and conse- 
quently R = U n 7^ n )(±.), that for every n > 0 there is a natural isomorphism 
<y n : Tl( n \±. m ) ^ ^(T( n )(0)), and finally that the 7 n are the morphism parts of 
an ascending u;-chain of objects in a suitable comma category-cpo. We may then 
conclude that the morphism part of the l.u.b. is a natural isomorphism 

7:i2 = |Jft< n >(±.) = -F(T), 

n 

and verify that 7 satisfies the equations given for ap in Section 1. 

The "finite functions" functor J 7 

For Z a fixed set, we may define what we will show is a continuous functor 
from Set* to Set* by the formulas, where A and B are any objects of Set* and 
h : A — > B is any morphism, A \-> A^ (taking the base point of A^ to be 
\z.*a) and h \—> (Xf.h o /), which we correspondingly denote by h\ z \ We may see 
that this is a functor by noting that its action on morphisms is that of a covariant 
hom-functor, or we may verify in detail 

l A [z] =Xf.l A of = Xf.f = l A[z] 

(h o g)W = Xf.h ogof = (Xf.h o /) o (Xf.g o /) = h™ o g™. 
For continuity, we verify that — ^ is ^-continuous on objects, i.e. that 

(U^f = lK |zl . 

n n 

because first, all terms and the l.u.b. of any o;-chain Aq C A\ C . . . (J n A n must 
share a common base point * in order for the inclusions to be base-point-preserving; 
second, if A C A', then an almost-e very where-* function from Z to A is also such 
a function from Z to A f , making — ^ monotone on objects, whence U n ^™ — 

(Un A n) , and third, if / G (U n A n y *\ then the finitely many non-* values of / 
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must already lie in some An, whence {{J n A n ) [Z] C \J n A n {z \ To show that 
preserves inclusions, calculate 

(A C A f ) [Z] = \f.(A CA')of = \fe AW.f = (AW c 

This shows that the functor — ^ is an object of Set^ et *. 

We next show that the mapping Zh-[^ is the object part of a functor J 7 : 
Setc — > Set* e *, and that T preserves Lu.b.s as colimits. Note that T preserves 
a l.u.b as a colimit, but not as a l.u.b. Also, restricting its domain to Setc allows 
T to be covariant instead of contravariant. 

We first introduce some compact notation. Let Z C Z C Z be sets and let / G 
be any function. We denote by f\ Z the function Xz G Z./(z) in A^, that is the 
restriction of / to Z, and by /]Z the function (Az G Z. if z G Z then f{z) else *a) 
in A^ z \ The following facts are immediate: 

(1) /1ZJZ^/jZ; 

(2) If {z € Z | /(*) ^ * A } C Z, then f\Z]Z = /. 

For any /i : A — ► A* in Set*, 

(3a) (hof)\Z = /io(/JZ), and 

(3b) (ho f)]Z = ho (f]Z) [because ft : *^ i— ► 

Let Zq C Zi C Z2 C . . . be any sequence in Setc- For any / € A^n z ^l, let 
rif denote the minimum index for which f\Z nf ] (J n Z n = /, that is, the minimum 
index such that f(z) = *^ for all z G (Un^) ~~ Then we may note the 

following facts: 

(4) For any n > n/, f\Z nf ]Z n = f\Z n , 
and 

(5) For any h : A — ► A f , f^hof < Ttf. 
We may now define the functor T : Setc — ► Setf et * by 

jr(Z) = 

^"(Z C Z) = A h-* \f G A [Z] ./1Z, 

To show that ^"(ZCZ) is a natural transformation is to show that the following 
diagram commutes for any h : A — ► A f : 



A [Z] *J-T\« ) A [Z] 



Xf.hof 



Xf.hof 



A m , A ,[z] 
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But this is just Fact 3b: (hof)]Z = ho (f]Z), for each / G A™. 

We must check that T is a functor. It is immediate that T{Z~Z) = the 

identity natural transformation. For a composite inclusion Z C Z C Z of sets, we 
have T{ZCZ) = f(ZCZ) ■ T(ZCZ) because f]Z]Z = f]Z for any / 6 

It remains to prove that T preserves any l.u.b. as a colimit. Let Zq C Z\ C • * • 

be any t^-chain in Setc with Z = |J n Z n . Fact A. 3 gives that i* : Setf et * — > 

I Set I 

Set* e creates colimits, and so by Lemma A.2 it will be sufficient to prove that 
i* o JF preserves Z as a colimit. That is, for each A G Set*, we shall prove that 

A^ with cone (Xf G A^ Zn hf]Z) n is a colimiting cone on the base A^ o] A/ - /1Zl > 
^[Zi] A/ ' /1Z2 ) . . . t To this end, let a = (a n : — > X) n be any cone on the 

same base with vertex X. Let fi(a) denote the morphism Xf.a nf (f\Z nf ) : A^ — ► 
X. We claim that fi(a) is the unique mediating morphism to the cone a, showing 
that A^ and its cone are a colimit. 

First we must show that fi(<j) mediates: For any n and any / G A^ Zn \ let n f 
denote n^z- By its definition, n' < n. Since a is a cone, the following equation 
holds for f\Z n r. 



(6) a n (f\Z n >]Z n ) = <Jn>(f\ Z ri)- 

By Fact 2, / = f\Z n r]Z n . By Fact 1, f]Z\Z n * = f\Z n >. Consequently, Equation 6 
is the required mediating equation as follows: 

(7) a n (f)=a nf (f]Z\Z nf ). 

Now to show uniqueness of n(a): Let g : vl^ -»• X be any mediating morphism. 
For any / € ^4^, since g mediates and since / = f\Z n AZ, the following equations 
hold: 

(8) 9(f) = 9{J\Z nf ]Z) = (g o (Xf.f]Z))(f\Z nf ) = <J nf {f\Z nf ) = n{a)tf), 
that is, g = i^(cr). 

It will be useful to spell out the effect of composite functors TZ\ o • • • o TZ^ 
on morphisms of Set* and of composite natural transformations T{Z\QZ\) o - - - o 
F{Zk^zZk) on objects. Restating the definition of T somewhat redundantly, we 
may write, for any one set Z G Setc, for h : A — > B in Set*, for / G A^ z \ and 
for z G Z^ 

TZhfz = h(fz). 

This is the case k — 1 of the following generalization to composites: 

Proposition 3.1, For Z±, . , Zu G Seto for h : A — ► B in Set*, for f G 

A&d" 1J , and for z\ G Z i? ... z fc G 

(9) [TZi O • • • O jTZfcJ/i/z! • • • Zfe = • • ' JZfc). 

Proof. By induction on For = 0 we have / G A, the composite functor is the 
identity Iset* 5 and (9) is simply 

heuhf = hf. 
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For k > 1 we have 

\TZ\ o • - - o J r Z k ]hfz 1 --z k = o • - - o TZ k ^{TZ k K)fz x • • • 

= TZ k h(fzi • • • z k -\)z k 

= Kfz 1 • - -2 fc ). □ 

Formula 9 should be familiar from the theory of combinators: it gives the effect 
of A;- fold iteration of the composition combinator B. 

We may similarly write out the definition of ^(ZCZ), for Z C Z G Setg, for 
A G Set*, for / G A^, and for zGZ: 

?(ZCZ) A fz = (f]Z)z. 

Again, this is the case k = 1 of a generalization to composites: 

Proposition 3.2. For C Z i? % = 1, . . . , k, for A G Set*, for f G A^ z ^'" [Zl \ 
and for z\ G Zi, . . . , z k G Z*, 

[^(ZiCZx) o ■ ■ - o T(Z k CZ k )} A fz l • • • z fc = (• • • (/1Zi)z! • • • 1 Z fc )z fc . 

Proof. For = 0, whence / G -A, the composite natural transformation is the 
(horizontal) identity, lset* : ^Set* ~~ ~* ^Set*? and the asserted equation is merely 

(lset Ja/ = 1a/ — /. 

For k > 1, write as a shorthand 5 for .FZi o • • • o TZ k -\, S for ^"Zi o • • • o TZ k ^\, 
and a for T{Z X ^Z X ) o - - - o ^(Z^-iCZfe,!) : 5 — > 5. Then 

[cr o ^"(ZfcCZfc)]^/^ • • • Z k = [S(T(Z k CZ k ) A ) o <Tr(z k )A]fzi - -Zk 

= S{T{Z k QZ k ) A ){(jjr {Zk)A f)z 1 ->-z k 
= J r (Z k CZ k ) A (aF(z k ) A fzi ■ • * Z k -i)Z k 

= {{&T{Z k )AfZ\ ' • * z k-l)] Z k )z k 

= ((•■■ (/1Zi)zi • --IZk-Jzk-ilZtizk 

respectively by definition of horizontal composition [7, p. 43 eqn. (2)], by definition 
of function composition, by Proposition 3.1, by definition of F{Z k C.Z k ) A , and by 
induction. □ 

Products and the functor 1Z 

m 

We recall from Section 1 the definition of the modified Cartesian product, f| #3 
of m sets with base points, for any m > 0: 

m m 

n.Ai = n^-{(*A,>r=i}uw. 
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m m 



We take it to be so clear as not to need a written-out proof that Y[ and FL are the 
object parts of two m-ary product functors on Set*, naturally isomorphic via the 
evident 



m rn 

A 



2=1 7-1 



: {a i ) r [L 1 i — ► (a>i)iLi otherwise, 



m 



and that the action of J| # on morphisms of Set™ is given by 



m 



(IL ^;)(ai, . . .,a m ) # = (/ii(ai),...,/i m (a m )) 



2 = 1 



m m 



Note that JJ # hi is a base-point-preserving map, as is Yl hi (same definition without 

4=1 Z=l 



m 



the spots); also that we have defined exactly one value of JJ» hi for every element 

2=1 

m 

of n«^i? because — * is a one-to-one correspondence. We will similarly take the 
liberty of writing A-abstractions, when convenient, in the form A(ai, . . . , a m } # 



We give the proof of continuity of J}. as a reminder that the restriction to finitary 
products is essential: 



m 



Proposition 3-3. Yl 9 : Set™ — > Set* is a continuous functor. 

Proof. We prove first ^-continuity on objects. For z = 1, . . . , m, let A^o Q An C • • • 
be an ascending u;-chain in Set* (necessarily sharing a common base point). Then 

TTL 

H.[J A in = {(ai,...,a m ) # | a. € (J^in, i = l,...,m} 

= {{ai, . . . , a m } # | 3ni • • • 3n m . ai 6 A • • • A a m G A mfcm } 
= {(ai, . . . , a m ) # | 3n . a; € im 5 i = 1, . . . , m} 

Then, to show preservation of inequalities, suppose that Ai C Ai, i = 1, . . . , m, 
and recall that as a function, an inclusion sends any element of Ai to itself as an 
element of Ai\ hence we may calculate 

m _ /in 171 _ \ 

U.(AiCAi) = A(ai,...,a m )V(ai,...,a m ) # = ( EL^^IL^)' D 

i=i v ?;=i %—\ ' 



m 

The finite products in Set* given by f| # yield — see, for example, [7, IIL5 ex. 5] — 
a "pointwise" product of any m functors Gi, ... , G m 6 Setf et * which we denote 
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m 

by II.Gi: 

7-1 



m 

m 



\=1 7 7-1 



m 

m 



(tl.Gi)h = n m (Gih) 

\=1 7 z=l 



m 

m 



Moreover, it is immediate that JJ # Gi is continuous because J\ 9 is an d the Gi are; 

i=l 

thus Set^ et * has finite products. 



m 



Now it follows, by [7, III. 5 Proposition 1], that Yl m is (the object function of) 
an m-ary product functor in Setf et *; the arrow function also works "pointwise" : if 
Ti : G{ — > Hi, i = 1, . . . , rn, then 



m 

m 



Finally as to products, we have 



m 

m 



Proposition 3.4, fl. : (Set* et *) — > Setf et * is continuous. 

Proof. Continuity on objects: For i = 1, . . . , m, let Gio Q Gn C • • • be an cu-chain 
in Set* et *. Then for each object ^4 of Set*, 



m 

( n. U G ») A = n. (U G in )A = n. (U G ^ A ) 



n n n 

vn . m 

= U.n. G m A = u( n. G in i) = (u n. c iB ) a, 

ti 1 n ^ n ^ — ^ 

and for each morphism /i : A — ► B of Set*, we may make an identical calculation 

m 

with h replacing A throughout. (For the central equality, recall that as a 
continuous functor, preserves l.u.b.s of a;-chains of morphisms as well as of objects.) 

* - - 

Preservation of inequalities: If Gi C G;, i = l,...,m, then for each object 
A G Set*, 

m 

n.(GiCGi) M = Tl.(GzCG,)A = U.(GiACGiA) 

1=1 J 2=1 1=1 

771 771 771 771 

= ( ff. GiAc n. g z a) = ( ( n. ac ( n. ^) = ( n. o t t n. gAa. □ 

v i=i i=i 7 v \=i ' \=i 7 7 \=i i=i 7 
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We are now in a position to observe that the functor 1Z : Set^ et * — ► Setf et 
although most perspicuously defined pointwise: 



m 



(KG) A = n. G^A, 

(HG)h = Y[.G {ki) h; i=1 

i=i 

m 

may in fact be built by composition from the continuous functors [J # and fc;-fold 
iteration: 

m m 
i=l 1=1 

and is therefore itself continuous. 

Laws of exponents as natural isomorphisms 

We turn next to the reinforcement of Propositions 1.1 and 1.3 with categorical 
sinews. 

m 

Proposition 3.5. The one-to-one correspondence ^ : J| # A^ Xi ^ = i^^i of 

7 = 1 

Definition 1.2 is natural in A (regarded as an object of Set*) and in Xi, . . . , X m 
(regarded as objects of Setcj. 

Proof. We show naturality in A first; that is, for fixed sets Xl, ... , X m and any 
morphism h : A — > 5 in Set*, that the diagram 



m M 



i=l 



77X 



2=1 

commutes. (For compactness, we shall omit writing object arguments such as A 
and (X x , . . . , X m ) for /i^). 

m 

We may write out the definition of fi^ : ]J m A^ Xi ^ — > A^=i ^1 explicitly: 

i=i 

/^m(/l) * ■ * ' /m)" = x).fiX . 

Then, for any #i G . . . , # m G A^ Xm \ 

m 

[vZr 0 Yl. hlXl] ](9l,---,9m)' = »m{h o g u . . . , h o g m )' 

i=l 

= \(i,x).[hogi](x), 

and 

F- Xl] ° /C] (ffi. • • • = X > ] (\{i,x). 9i x) 

= ho (\(i,x).gix), 
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which is the same function. 

This has shown naturality in A, that is, for fixed X±, ... , X m , that 



771 



m 



i=l i~l 



Now for naturality in Xl, ... , X m we need, supposing X{ C Xi for ii 
commutativity of 



rrt 



n. A/./ufi 



m 



A/./1 £ 

1=1 



i=l 

For G i = 1, . . . , m, we find 



T71 



m 



i=i 



= (A(z,a;).^x)1 £™ x X; = [(A/./1 EHi^i) o ^}(gi, . . . ,g m ) m . □ 



Proposition 3.6. The one-to-one correspondence v k : A^ Zk ^'" 1 = ^4^1=1 z ^ of 
Proposition 1.3 is natural in A and in Zi, . . . , Z k for any k > 0; that is, v k is a 
natural isomorphism of functors from Set* to Setf et * . 

Proof. Recall that for g £ A^'" 1 we have 



^(5) = . . . , z k ).gz 1 --Zk. 

We first show naturality in A, that is, that for fixed Z\, ... , Z*. G Setc, and 
/i : A — > 5 in Set*, the diagram 



A/A21 ...\zk.h{f z\ - -Zk) 



[Zi] 



XfXz.h{fz) 



commutes. (As with //^ in Proposition 3.5, we have not written object parameters 
for v k . The label on the left side arrow comes from Proposition 3.1.) For any 

f e ' 1 , this works out to the true equation 

\{z u . . .,z k ).h(fz 1 ---z k ) = ho (A(zi, . . . ,z k )J z ± • • -z fc ). 



Thus z/fc : o • • • o TZ k — » T{Z\ x • • • x Z k ) is natural. 
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Now for naturality in Zi, . . . , suppose Zi C Zi for i = 1, . . . , A;, and for 
any A G Set* consider the diagram (the label on the left side arrow comes from 
Proposition 3.2): 

A [Z k ]-l Z i] "k ) ^[ZiX-xZjfc] 



J^Z k \ 



Xg.g](Z 1 x---xZ k ) 



[Zi] 



To show commutativity of this diagram is to show, for any / G A^^'" 1 , that 



\{z x , . . . ,z k ).{- -{f\Z 1 )z 1 - ^]Z k )z k = (\(zu... i z k ).fz 1 ---z k )MZ 1 x x Z k ). 

For any A;-tuple (2i,...,2jt) G Zi x • • ■ x Zfc, both sides of this equation yield 
/ z\ • • • z k . On the other hand, if {zi> . . . , z k ) G Zi x • • • — ZiX * • • xZ k , then 
the right-hand side of the equation yields Let j be the smallest index such that 
Zj € Zj — Zj] then the left-hand side also is 

(' * ' ((*A[^]- [z i+ij)1^i+l)^j + l ' m *]Zk)Zk 

= (' ■ ' ((^J + 1 • * • AZfc-*A)1 Zj + ijZj + x • • • 1 Zfc)z fc 

= * A . □ 



Combining the results of Proposition 3.5 and Proposition 3.6, we obtain: 

Proposition 3,7. The mapping ip kl ... kni , given for gi G A^ Zik ^'" %1 (i — 1, . . . , m) 
by 

is an isomorphism 

^,..^:n.^ ] -" tZil] = ^ Er = in ^^ ] 

1=1 

natural in the ki + • - - + k m sets Zij and in A. 

Proof. When natural isomorphism //^ (Proposition 3.5) is written without parame- 
ters, we have the following explicit expression giving functors from Set™ to Set* et * 
as its domain and codomain: 

m 

m 
m 

Here J 7771 denotes T x * ■ - x T with m factors, and ^2 denotes the ra-ary coproduct 
functor in Seto Consequently, writing fj for fc^-ary Cartesian product in Setc, 

^°(nx---xn):n.°J rm °(nx---xn) = ^oE°(nx-xn) 
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is a natural isomorphism of functors from Set^ 1 x • • • x Set^ m to Setf et * . 
Similarly Proposition 3.6 gives, for i = 1, ... , m, 

v ki :(o ki )oT k <^Fof[, 

displaying two functors from Set^ to Setf et * as explicit domain and codomain for 
Vki 5 and so we have by composition and m-ary product the natural isomorphism 

m m ra . , 

n.°("feix---x*o ■u.°{°k 1 oT^x---xo krn oT k -) = n.°(^ o nx---x^°n) 

again of functors from Set^- 1 x • • • x Set£- m to Setf et * . Here the natural isomorphism 
v kl x • • • x Uk m is defined to be the componentwise mapping 

(Yl, . . . , Ym) H-* (VkiYli ■ • • j VknYm) 

where each Y{ is a fc;-tuple of sets. 
But 

(n x ••• x n) = (foflx ••• x j-o n), 

so the above two natural isomorphisms are composable, yielding 



d f 

(io) Vfci-fc™ = K ° (n x • • • x n)) • (n. ° * • • • x io) 

. 777/ fc ^ fcj yti 

■ n. ° (°fci ° ? ki x ■ • • x o km o T k ~ ) ^ o £ o (n x . . . x n) . 

Reverting to consideration of arbitrary objects Zij , j = 1, . . . , fc*, i = 1, . . . , m 
of Setc and A of Set*, we may work out the effect of ip kl ... km (((Zirf^Zi) 



ra 



[Z i] 

on any element . . . , # m ) # of JJ # A^ z%k ^ " 11 (as usual we suppress the object 
arguments): i=1 

= ^(^(^b • ' • ? Z ki )-9l Zl'-Zkn • • • , A(zi, . . . , Z krri ).gm Zi'" z km )* 



Construction of j as a least fixed point 

In developing ^fci— fc m > we deliberately made provision for fci H h A; m separate 

sets Zij-; this was with an eye to facilitating the treatment of many-sorted term 
algebras in Section 4. Our need at the moment, however, is to come down to one 
set Z. We note the following fact about diagonal functors. For any functor F and 
any k > 0, 



(ii) 



F k o A k = A fc o F. 
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Now observe that we may express our term-algebra functor T : Setc — > Setc 

as 

T = E°(nx---x n)°(A fcl x-.-x A fcm )oA 5 



and the trie-building functor 71 : Setf et * — ► Set^ et * as 



m 



n = II. ° (°fci x - • x o km ) o (A fcl x • . • x A km ) o Am. 
Using Equation 11, we may express the composite 71 o T as 



m 

i r i i / \ t v . . . y / \ t t r\ / \ 



n 0 T = EI. ° (°fci x • • • x °k m ) o (F kl x • • • x JF fc ~) o (A fcl x • • • x A fcm ) o A, 



m 



= fl. 0 ((°fci o J^ fcl ) x • • • x (o fcm o jr fc ™)) o (A fcl x • • • x A km ) o Am. 
If we define the composite 

* = Vfci».fc m 0 x - x A fcm )oA m , 

then Proposition 3.7 and the explicit expression for the type of ^^...^ given by 
Formula 10 yield 

Proposition 3.8. We have the natural isomorphism of functors from Setc to 
Setf et * 

For Z e Setc and A e Set*, the effect of : fL^? 0 ^) = x z ^ (A) is 
given by 1=1 

We may now begin the construction of 7 : i? = T(T). For brevity write J for 
the identity functor J Set s e t* . Define what we will show in a moment is a continuous 
endofunctor of TIT by 

1{T — + T\T 

<G, Z, r : G — + TZ) i— ► <ft(G), T(Z), • ?e(r)) 
(<r : G — * G', (Z C Z)> 1— ► (71(a), (T(Z) C T(Z))) . 

Note that ^ • 7Z(r) is an isomorphism if r is. 

By Lemma A. 5, C is a well-defined functor, taking D = X, E — T , G — R = TZ, 
p — It^, H ~T , and cr = \& in the statement of the lemma. 

Recall (from Proposition 2.10) the forgetful functor P : Jj^ 7 — > Setf et * x Setc- 
Since 71 and T are continuous, P o C — 7Z x T is continuous. Consequently, 
Proposition 2.10 gives that C is continuous. 

Now we will construct a least fixed point (J?, T, 7) of C. The functor _L. is initial 
in Setf et *: for any G G Set^ et * we have the unique 

(A 1 — ►(•!-► : ^. — ^ G . 
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JF(0) is also initial, as shown by the natural isomorphism 

7o d J f (A ^ (• h+ Xx G 0 . * A )) : J_. J^(0) . 

By initiality, and the fact that every !Z(G)(A) is a spot product, and so contains 
•, we have the commutative diagram 



-1. = ► K(±.) 



(12) 70 



JF(0) ► HW)) ■ 

JF(0CT(0)) 

Let Rq = and To = 0. For brevity, write E for the inequalities (C, C) of Xj^ 7 . 
Then (12) is 

(i? 0 ,T 0 ,7o) E C(i2 0 ,r 0 ,7o} 
drawn as a diagram in Setf et *. It allows us to generate the ascending o;-chain in 

(13) {#o,To,7o> E (RuTu-yi) E (^2,T 2 ,7 2 ) E 

where (R n ,T n ,j n ) = C( n )(_L.,0,7 O ) or, in detail, 

i?n+l = fc(Rn), r n+ i = T(T n ), 7 n+ i = *T n ' ^7n) • 

Then Proposition 2.8 gives that (13) has a l.u.b. (it!, T, 7) where i? = Un^™> 
T = U n T n , and (since, by induction, each 7^ is an isomorphism) 7 : i? = !F{T). 

Considering C merely as an u;-continuous mapping on the objects of T\T we 
have found 

( J R,T, 7 ) = |JcW(±.,0,7o). 



n 



We pause to note the following fact about epos, useful for finding least fixed 
points when there is no least element. 

Fact 3.9- In any cpo K, if a function F : K — ► K is u) -continuous, and ifkeK 
is such that F(k f ) □ k for all k f , then F^(k) is the least fixed point of F. 

To see this, recall the familiar fact that |J n F^ n \k) is the least fixed point of F 
above k; the hypothesis ensures that every fixed point of F is above k. 

Now we may observe that _L. C U(G) for every G G Set^ et * and 0 C T(Z) 
for every Z € Setg, so the initiality of J_. gives that (_!_., 0, 70) E C(G, Z, r) for 
every {G^Z^r) G T\T. Hence, by Fact 3.9, (i?,T, 7) is the least fixed point of C. 
Moreover, if (i?, T, 7') is any fixed point of C connecting R to T, then Fact 2.9 gives 
that 7' = 7. In other words, 7 is the unique natural transformation from R to 
T(T) satisfying 

7 = \Er T . ^(7) . 
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(The uniqueness of 7 can be proved using only the initiality of (i?o,2o,7o) [3, 
Proposition 2.31], but the partial order in I[T simplifies the proof here. Reynolds 
noted a similar uniqueness for the relational functor which he constructed in [10].) 

Recovering the look-up algorithm 

We may now unpack the fixed-point equation characterizing 7 to recover the 
look-up algorithm "ap" which was written down but not justified in Section 1. 
Let A £ Set* be any pointed set and let (ri, . . . , r m )* £ R(A) be any T-indexed, 
^4-valued generalized trie. We have 

rn 

7A(ru . • ■ , TmY = (*r • fl.7 (fc0 ) A (n, . - . , r m ) # 

1=1 

m 

= {^TAoU.(l {ki) )A)(ri,...,r m )' 

i=l 

= (A(</i, . . .,5m)* . A(i, . . . ftki)} • 9iti* - - tki) 

((7 (fcl) )Ari s ...,(7 (fc ™ ) )Ar m >* 

(14) - A<», <*i, . . . , f fc . » . (7 (fc °)Ari *i • • • h t . 

Recall that for any functors F, G : M — > L and F' , G ; : L — ► X and any natural 
transformations r : F — ■+ G and r' : F' — G', the horizontal composition 
r* or \ F f o F — > G' o G is given [7, p. 43 Formula 3] by 

r'or^G'or) - (r' o F) . 

For endofunctors F,G : L — > L, r : i* 1 — ■+ G, and any A; > 0, this yields by an 
easy induction the formula for : F< fe > — + G< fe > 

(15) r<*> = (G**" 1 ) o r) • (G^~ 2) oroF).....(Goro F (/c_2) ) • (r o F (fc-1) ) . 
So for any k > 0 and r G i?( fc )(A), we have, since 7^) : — > 

( 7 ( fc )) A r = o 7 ) • {T^~ 2) o 7 o R) ■ . . . ■ (JP T o 7 o i?( fc " 2 )) - (7 o i^- 1 ^ r 

= [O^t (7a)) o (?t (liHA))) 0 • • • 0 (^r(7fl(*-a)(A))) 0 7h(*-d(a)] r 
= _1) (7A)(^r ~ 2) (7fl(A))(- ' • (^r(7fl<*-»>(A))(7ji(*-i>(A) r )) •••))• 

From Proposition 3.1 we have, for any j > 0, for an appropriately typed morphism 
h of Set* and function /, and for ti, ... , tj £ T, 

rffhf h---tj = h(f t ± --- tj) . 
Let ti, . . . , tk be any terms, and apply this in turn for j = k — 1, k — 2, . . . , 1, 
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yielding 

{l {k) )ATt^^t k 

= " 1) (7A)(J r r ~ 2 \lR{A))(- • * (^T{lRik-2) {A) )(^ R(k -i) {A) r)) • • • ))tx ■ - - t fc 

= 1a{Ft ~ 2 \lR{A)){- - • (^r(7H(fc-2)(A))(7i?(^-i)(A) r )) • ■ ' )*i * ' **fc-i)*fc 
= 7a(7h(A)((* * * (^r(7H(fc-2)(A))(7H(fc-i)(A) r )) "••)*!' * 'tk-2)tk-i)t k 



= 7a(7h(A)(* • ' (^ r T(7H(fc-2)(A))(7^-i)(A) r )*i i 2) • • • )tk-l)tk 
= 7a(7h(A)(' ' *7 J R(fc-2)(A)(7^(^-i)(A) r *i)*2 * * ' )tk-i)tk . 

So Equation 14 becomes 

7a(^i, . . • , r m ) # = A(z, (t x , . . . , t fc .» . 7a(7a(A)(- ' * (7H(fc-D(A) r <i) ' 1 ' ■ 
Since 7^ is base-point preserving, we may expand this as 

1a • = *At T ] = . *a 

7a(^i, • • • , r m ) = A(i, (ii, . . . , t fc . )) . 7a(7*(A)(* ■ • (7H(*-i)(A) r <i) • ' ' )*fc,-i)*fc i . 

This is precisely the recursive definition proposed in Section 1 for ap. 

A more realistic set of terms 

In a typical application of generalized tries, such as to a table of common sub- 
expressions in a compiler, one would be likely to find that the "terms" to be looked 
up were not quite an instance of the term algebra T we have been discussing, but 
rather were defined by an equation like 

T f — T ,kl + h T /fem + V 

with V being a large, possibly infinite, set of unstructured elements such as iden- 
tifiers or numerals. We sketch here how any reasonable (that is, functorial) data 
structure for ^-indexed look-up tables can be incorporated with the trie idea. 

Suppose then that T" is as just described, that is, the least fixed point of a 
functor T' : Setc — > Setc defined by 

T'(Z) = Z kl H h Z km + V, 

We suppose that a functor B : Set* — ► Set* encapsulates some data structure for 
F-indexed tables, its look-up function being a natural transformation (3 : B — ► 
F{V). (For example, B might assign to each pointed set A the set of all V- 
indexed, A-valued binary search trees. With such possibilities in mind, we refrain 
from supposing that ft is an isomorphism, which would be to suppose a unique 
representing data structure for each finite function.) 
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The elements of R!(A), the T'-indexed, A- valued tries, will correspondingly have 
an m + 1st field in each tuple, containing an element of B(A) or of B(R f (A)) or 
.... That is, we define 

n'(G) = G {kl) x. ••■ x. G (km) x. 5, 
n\r) = r< fcl > x. ... x.r< fem > x. 1 B . 

Suitable adjustments to the constructions used for Propositions 3.7 and 3.8 above 
will then produce a natural transformation (not an isomorphism unless (3 is) 

with the effect of 

V ZA : Yl.f { z ki) (A) x. B(A) — > ^ Er=i ^ +V (A) 

i = l 

given by 

• ■ • > 0m, &}" = Mm+^fei • * ■ J V km (9m), /?a(&)> # , 

that is, 

*za<01i • * -i9rn,b) 9 (i, (z u . = # *i ■ ■ ■ if i < m, 

The construction of C : Tj-F — ► Jj^ 7 and its least fixed point (i?', T", 7'), with 
7' : i?' — ^ T(T f ) and 7' = \&^/ • H'iY), may then proceed as before. 

Section 4. Many-sorted Tries. 

In extending the result of Section 3 to many-sorted term algebras the greatest 
difficulties are notational. For many-sorted algebras we follow Goguen, Thatcher, 
Wagner, and Wright [5] in substance, although our notion of signature is arranged 
differently from theirs in order to follow our treatment of the one-sorted case more 
closely. 

Let 5 be a set of "sorts" . We call a finite sequence of sorts, that is an element w of 
5*, the free monoid over 5, a "word" on 5; we denote its length as A signature 
for an 5-sorted algebra should provide for each operator a result sort and a finite 
sequence, that is a word, of argument sorts. It is necessary to our trie construction 
that for each s G 5, only finitely many operators have result sort 5 (this is our only 
substantial departure from the notion of ^-sorted algebra in [5]). Accordingly we 
suppose that an ^-sorted signature is a pair (m, k) where m is an 5-indexed family 
of non-negative integers and, for each s E S, «($) : {1, ... ,m(s)} — ► S*. For 
s e S, 771(5) is the number of operators having result sort s, and for 1 < i < 777(5), 
K(s)i gives the arity (a word on S) of the ith operator of result sort 5. 
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To give as familiar an example as possible of a two-sorted term algebra, we take 
ordered trees and ordered forests, with the tree nodes again labeled by a single bit; 
the mutually recursive definition is 

Tt = T F + T F , 
T F -{()}+T T xT F . 

We would like the Tt- and Tp-indexed tries to come out satisfying the corresponding 
equations 

R T (A) = R F (A) x R F (A), 
R F (A) = Ax R T (R F (A)). 

As is well known, ordered forests are in one-to-one correspondence with binary 
trees. Following the example may be facilitated by observing that a Tp-indexed 
trie comes out as a reformatting, via the template { , , ) »-> ( , { , )), of a 
Te-indexed one-sorted trie. 

The same two-sorted syntax of terms may be expressed more opaquely according 
to our definition of many-sorted signature by the choices S — {t, f} (for "tree" and 
"forest", not the truth values), ™(t) = m(p) = 2, k(t)i = k(t) 2 = F, «(f)i = e 
(the empty word), and k(f)2 = tf. 

We follow [5] in a convenient generalization of the exponential notation: if X is 
an ^-indexed family of sets and w 6 5* is a word, w = W\W2 • - .Wk say, then X w 
denotes X(wi) x • • • x X(wk). 

Fixing now on any arbitrarily chosen 5, to, and «, we may make the construction 
of the term algebra look very much like the one-sorted case. Define the functor 
T : Set£ -* Set£ by, for each s e S and Z € Set£ , 

m(s) 

T(Z)(s) = £ Z K ^. 
Then let T € Set^ be given by 

r = |Jr (n) (Ases.0). 

(Slightly abusing notation, we write plain C and (J for the componentwise extension 
of inclusion and union to S- indexed families of sets. In particular Z C Z 7 , where 
Z and Z f are i^-indexed families of sets, if and only if Z(s) C Z f (s) for all 5 G S. 

Analogously we extend C and (J to /S-indexed families of functors.) 

Similarly generalize the notation for n-fold composition: if G is an 5-indexed 

family of endofunctors of Set*, that is an object of (Set^ et *) , and w is a word 

on S with \w\ = A;, let C?(™) denote G(w{) o « • • o G(wk)> Likewise, if r is an S- 
indexed family of natural transformations of such functors, that is a morphism of 

(Setf et *) 5 , let r<™> denote r(«/i) o • • • o r(w fc ). 



30 



RICHARD H. CONNELLY AND F. LOCKWOOD MORRIS 



Now we may define i?, the T-indexed trie functor, much as before: 

* 

R = |J^ (n) (As G S.±.) 

n 

where K : (Setf et *) — ► (Setf et «)° is given, for objects and morphisms G and r 
of (Setf et *) S and for a G S, by 

rn(s) m(s) 
i=l i=l 

There is nothing new in the verification that T and 1Z are continuous endofunctors 

of Set£ and (Setf et *) 5 respectively. 

The sets of terms and of tries which for readability we called Tt, etc. have official 
designations T(t), etc. 

Using the notation of [7, p. 45], we have the continuous functor 

J*: Set? — (Set^)^ 

r s (z)(s) = F(z(8)), 

T S (Z C Z')(«) = ^(Z(«) C 

for 5-indexed families of sets Z, Z' and for s £ S. 

Applying once more the notational idea from [5] of generalizing from non- negative 
integers to words, we may, for any category L and w G S* with \w\ = A;, define 

A w : L s — 

for both objects and morphisms of L 5 . As earlier with A n , we shall leave the 
category L to be determined by context. 

Now, combining for each result sort its own version of the natural isomorphism 
V'fci'-fcm introduced in Proposition 3.7, we may define a natural isomorphism \& of 

functors from Set£ to (Setf et *) 5 , 

by, for any Z £ Set^ and s E 5, 

*z00 = [^|it(.)i|-|«(aHs<.,| 0 ( A «Wi x ' ■ * x Ak( s) _ (s) ) o A m(s) ] z 
: 7^ 5 Z)(s) 2 F s (TZ)(s) = F(TZ{s)). 

For ^4 € Set*, the effect of ^z(s)a is given by 

:lVf) (K(s) ^ =TlV ((K(s) *^ ( ^ |)] -" tZ(WS)i)l)] — ^> z . (s)i (A), 
i=i i=i ^ i=1 

^z(s)a(«/1) • • • 5 9m(s))' = Mh (zi, ■ ■ ■ , ^^(s);!)) • #i zi • • • z Ks);| • 
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Write X s for the identity functor on (Setjf et *) 5 , Then we may define a contin- 
uous endofunctor of the comma category X S \T S by 

X s [T s — > X s {T s 

{G,Z,r:G — + T s Z) i— ► (K{G), T(Z), V z • K(r)) 
(a:G—^ G", (Z C Z')> »— ► <ft(<r), (T(Z) C T(Z'))> . 

Lemma A. 5 shows, taking D — I s , E — JT 5 , G — R ~ TZ, p = 1^, H — T, and 
cr = that C is a well-defined functor; C is continuous because 7£ x T is. 

The ^-indexed family _L. = f As . _L. is an initial object in (Setf et *) 5 , and we 
have the natural isomorphism 

7 0 d 4 f As . 70 : X. S As . JT(0) . 
We obtain, as in the one-sorted case, 

7*0 /T 



(i?,T,7)-LJc W (i.,A 5 .^(0),7o), 



n 



with 7 a natural isomorphism, and the unique natural transformation satisfying 
7 = %'^(7)- 

To uncover the look-up algorithm in 7, we may begin by writing an analogue to 
Formula 14, for s G 5, AG Set*, and (n, . . . , r?n(s)) G i2(s)(A) any T(s)-indexed, 
A- valued trie, 

(16) 7 y(s) A {ru . . . , ^m( s )) # = A(i, <ti, . . . , *|«^>^|>> • (7 ( * (fl)i) ) a*"* *i * * * * 

The formula for horizontal composition generalizes just as well to k pairs of functors 
which need not all be the same; so for any category L, any two S-indexed families 
F and G of endofunctors of L, that is any two objects of (L L ) S , any morphism 
r :F— >Gof (L L ) S , and any w 6 S* with |w| = k we have the 5-sorted analogue 
of Formula 15: 

. . . • o r (w 2 ) o F {w3 ~ Wk) ) ■ (t(wi) o F^- WkS >) . 

Thus for r 6 we have, since 7^) : — ■» (f§) {w) , 

... (J-l)^ 1 )^^)^^— fc ) (A) )(7(^i) 5 (- 2 •■•))• 

For any prefix w~ = w\ • • -Wj of w, for any morphism h : B — > jB' of Set,, for 
/ e B^K-)!"" pr(, " l)1 j an d f or t x e T(wi), . . . , tj € T(wj), Proposition 3.1 yields 

(f^ w ~ ) hft 1 ---t j = h(ft 1 ---t j ). 
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Then for A G Set*, r G R (A), and U 6 T(wi), i = 1, . . . , fe, we obtain 
(7 ( ™ } )a^i = 7(wfc)A(7(wjfe_i) g ( Wfc ) (A) ( . . . 

So we may write the ^-indexed family of mutually recursive routines implicit in 
Formula 16 as 

j(s) A * = \teT(s).* A 

7(«) A (^l,..., = A(i, *!«(,).!)) .7((«(«)i)|K(a)i|)A( 

* • ' (7((«(*)i)l)g«*C-)i)2-"(«(-)i)| l «(.) i |)( A ) r *l) ' * 

This is not really as horrible as the general notation makes it look. For our example 
of ordered trees and forests, calling the two look-up functions T-ap and F-ap rather 
than 7(t) and 7(f), it works out to 

T-ap A • t = *a, 

T-ap A (r x , r 2 )(l, (/» = F-ap A n /, 
T-ap A (ri,r 2 )(2, (/)) = F-ap A r 2 /; 

F-ap A • / = * A , 
F-ap A (ri 9 r 2 >(l, ()) = r u 

F-ap A (n, r 2 )(2, <t, /)) = F-ap A (T-ap Hr(A) r 2 t) / . 

Appendix, Colimits in Comma Categories. 

Below are two theorems which show the creation of a colimit by parameters in a 
category of functors and in a comma category. The first is MacLane's "(co)limits 
with parameters" theorem [7, Theorem V.3.2] which we state here as a fact. The 
second is Bierle's theorem, which shows how to construct a colimit by components 
in a comma category [1, Fact L4], This theorem was independently discovered 
by Connelly [3] and by Casley, et. al. [2]. The presentation here uses MacLane's 
concept of "creating colimits" [7, p. 108], defined as follows. 

A functor V : K — ► M creates colimits for a functor F : J — ► K if and only if 
for any colimit cone r : V o F — -> m there is a unique object k in K and unique 
cone a : F — -> k such that V o a = r and, further, this a is a colimit cone. 

The functor V is often a forgetful functor, sending every morphism of K to itself 
as a morphism of M. In this case we may describe the creation of colimits by V for 
F as the existence, for any colimit cone r : V o F m, of a unique k e K such 
that Fk — m, and r : F —— ► k is also a cone in K, and r is also a colimit in K. 

The definition of "creates colimits" yields the following obvious fact. 

Fact A.l. Let a : F — -> k be a colimit cone created by V. For any cone rj : F — > 
the unique mediating morphism (u.m.m.) f from VoatoVorjis the image of 
the u.m.m. g from a to rj (that is, Vg = f). 

In the case of V a forgetful functor, Fact A.l says that any u.m.m. in K from 
the created colimit o to another cone on F is the same as the u.m.m. in M between 
the composites of the two cones with V. 
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The next lemma gives a condition under which a functor that creates colimits 
can be used to prove that a second functor preserves colimits. 

Lemma A. 2, Let F : J — > K, G : K — > L, and V : L — ► M be functors. If 

V creates colimits for G o F, and V o G preserves colimits of F, then G preserves 
colimits of F. 

Proof. Let a : F — -> k be any colimit cone. By hypothesis, V o G o a is a colimit 
cone. Since V creates colimits of G o F, G o a must be the colimit cone created by 

V for V o G o a. □ 

MacLane's theorem gives a way to create a colimit by parameters in a functor 
category. It uses the following notation. Let |L| denote the discrete category whose 
objects are those of L, and let i : \L\ — ► L denote the insertion functor. Define 
i* ; K L — > as the functor sending H i-> H o i for each object (functor) and 
7] h-» 7] o i for each morphism (natural transformation). 

Fact A. 3. [7, Theorem V.3.2] Let K and L be any categories. The functor z* : 
K L — > i^l 1 '' creates colimits (for any functor F : J — ► K L ). 

Lemma A. 2 and Fact A. 3 mean that for any functors F : J — ► M and G : 
M — > if i* o G preserves colimits of F, then G preserves colimits of F. This 
is especially useful because \L\ is discrete and so it is sufficient to show that i* o G 
preserves colimits for each object / of L. 

Beierle's theorem uses the following notation. Let T : L — ► K and S : M — ► K 
be any functors. For any object z G T|5, will denote the morphism in z 
(i.e., the third component). Consider the following forgetful functor and natural 
transformation (the latter is taken from [1]). 

TiS — > L x M 

P l :Ton 1 oP—>SoII 2 oP 

(l,m,f) i — > (i,m) 

: z i — > 

(w, v) i — > (w, u) 



The composites IIi o P and n 2 o P will be abbreviated by Pi and P 2 respectively. 

Here is Beierle's Fact 1.4. The statement of the theorem here is more detailed 
than the references [1, 2, 3], but the proof method is the same. 

Theorem A. 4. Let T : L — > K and S : M — ► K be any functors. Let F : J — ► 
T[S be any functor such that T preserves colimits of P\ o F. Then the functor 
P : T[S — ► L x M creates colimits for F. Specifically, if r : P o F (/, m) is 
any colimit cone of P o F, then in the created colimit cone r : F — — ► (/, m, f), f is 
the u.m.m. in K from the colimit cone T o P l o r to the cone (S o P 2 0 t) • (P; o F). 
Further, if S preserves colimits of P 2 o F, and P[oF is a natural isomorphism, then 
f is an isomorphism. 

Proof. Let r : P o F —> (I, m) be a colimit cone. Since T preserves i7i o r as a 
colimit cone, T o Z7i o r is a colimit cone. Let / : T(Z) — ► S'(m) denote the u.m.m. 
from T o IIi ° t to (5 o 77 2 o r) ■ (Pj o P). That is, / satisfies the following diagram 
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for each j G J. 



T(F(j)i) -^H T(Z) 



f 



S(F(j) 2 ) ► S(m). 

S(r(j) 2 ) 

The diagram gives that r(j) is a morphism in T IS from F(j) to (Z, m, /) for each 
j G J. Consequently, since r is a cone from PoF to (Z, m), it is also a cone from F to 
(Z, rn, /). To see that / is unique in the vertex (Z, m, /), suppose (Z, rn, /') to be any 
other vertex of r in TjS. For each j G J, the morphism r(j) : P(j) — ► (Z, rn, /') 
implies that the diagram continues to commute if / is replaced by /'. Consequently, 
/' mediates from To 77 x or to (SoZ^or^PjoP) and must be the u.m.m. /. When 
S preserves colimits of P 2 ° F and P| o F is an isomorphism, (5 o U 2 o r) • (Pj o P) 
is a colimit cone and so / is an isomorphism. 

It remains to prove that r is a colimit cone in TJ,5. Let a : F (Z, m, /) be 
any cone and let v : (Z, m) — > (f, rh) denote the u.m.m. from P o r to P o a. It 
must be proved that v is a morphism in T|5 from (Z, m, /) to (F, m, /) and that it 
uniquely mediates from r to <r. The only part of this that is any work is to show 
that v is a morphism at all, as follows. 

Since r(j) : F(j) — > (Z, m, /) for each je J and since mediates from P 2 ° t 
to P2 o <j, the following equations are valid for each j 6 J: 

5(^)o/or(r(j)i) = 5(^ 2 )o5(r(i) 2 )oF(i) i = 5(^or(j) 2 )oF(j) i = 5((r(j) 2 )oF0) r 

Since v\ mediates from Pi o r to Pi o a and since a(j) : F(j) — ► (Z, m, /) for each 
j G J, the following equations also hold for each j G J: 

/o T(i/i) o T(rC;)i) = f °T(ui o r(j)i) = foT(a(j)i) = 5(<t(j) 2 ) o 

These two sequences of equations show that 5(^2) 0 / and / o T(^ x ) both mediate 
from colimit cone T o P x o r to cone (5 o P 2 o <j) ■ (Pj o P) and so must be equal. 
Consequently, v is a morphism in T\S from (Z, rn, /) to (Z, m, /). 

That z/ uniquely mediates from to r is true simply because v uniquely mediates 
from P o a to P o r. □ 

The following corollary gives a condition under which a functor whose codomain 
is a comma category will preserve colimits. It is similar to Corollary 2.19.3 in [3]. 

Corollary A.4.1. Let H : Q — > T[S be any functor. For any functor F : J — ► 
Q, if P o H preserves colimits of F and T preserves colimits of Pi o H o F, then H 
preserves colimits of F. 

Proof. By Theorem A. 4, P creates colimits of H o P. Take V to be P and iJ to be 
G in Lemma A. 2; then H preserves colimits of P. □ 

The following diagram lemma for the construction of an endofunctor of a comma 
category, which we apply in Sections 3 and 4, generalizes a construction used in 
Chapter 7 of [3] that in turn was inspired by the relational functors defined by 
Reynolds [10]. It seems time it was recorded separately. 
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Lemma A. 5. Given five functors D : L — > K, E : M — > K 7 G : L — ► L, H : 
M — ► M, and R : K — > K, and two natural transformations p : D o G — > Ro D 
and a \ Ro E — -> E o H, the following defines a functor: 



C 



DIE — > 

</, m, /> .— (G(Z), tf(m), <r m o o Pz ) 
(g,h)^(G(g),H(h)). 



Proof. To see that C is a functor, consider any morphism {g, h) : (l,m,f) 
{V, m', /') of Dj-E as a commutative diagram in K: 



D{1) -^U £>(/') 



(*) 



/ 



E(m) 



E(h) 



E(m') 



This is sent by C to the outer rectangle of the following diagram. 



D{G{1)) D{G(9) \ D(G(V)) 



pi 



pv 



R(D(l)) R(D(g) \ R(D(l')) 



(**) 



«(/) 



i?(£(ra)) 



R(E(h)) 



R(f') 



R(E{m')) 



m 



m 



E(H(m)) 



E(H(h)) 



E(H(m')) . 



The upper pane of (**) commutes because p is a natural transformation, and the 
lower pane because a is. The center pane is simply the image of diagram (*) under 
R. Consequently, (**) commutes and C is well defined. C preserves identities and 
composition simply because G and H are functors. □ 

As we apply the lemma here, we have D always the identity Ik, with G = R 
and p = 1r, so that the definition of C reduces to 



C : I K IE — I K [E 

: (I, m, /) i * <iJ(Z), #(m), a m o #(/)) 



:(g,h)~(R(g),H(h)) 
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